Methods for the reduction of stutter in microsatellite amplification

ABSTRACT

The invention provides a method for reducing stutter in the amplification of a microsatellite comprising the steps of providing a sample comprising a microsatellite having a G+C content of 50% or less; contacting the sample with at least one enzyme having nucleic acid polymerase activity; and incubating the sample with the enzyme for a sufficient amount of time and under conditions sufficient to amplify the microsatellite; wherein the incubation is performed in the presence of an amount of betaine, sorbitol or mixtures thereof, effective to reduce stutter relative to the amount of stutter observed in the absence of betaine and/or sorbitol. The invention also provides compositions containing betaine and/or sorbitol, kits for amplifying microsatellites having a G+C content of 50% or less, and methods of using all of the foregoing.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 12/769,577, filed Apr. 28, 2010, which is a continuation of U.S. application Ser. No. 11/742,524, filed Apr. 30, 2007 (now abandoned), which is a division of U.S. application Ser. No. 10/637, 466, filed Aug. 8, 2003 (now U.S. Pat. No. 7,211,385, issued May 1, 2007), which is a division of U.S. application Ser. No. 09/850,514, filed May 7, 2001 (now U.S. Pat. No. 6,841,349, issued Jan. 11, 2005), which disclosures are herein incorporated by reference in their entirety.

FIELD OF THE INVENTION

The invention relates to methods, compositions and kits for reducing stutter in polymerase chain reaction amplification of microsatellites. In certain embodiments, the invention relates to the use of sorbitol and/or betaine in polymerase chain reactions in an amount effective to reduce stutter in the amplification of mononucleotide, dinucleotide, trinucleotide, tetranucleotide, and pentanucleotide microsatellites.

BACKGROUND OF THE RELATED ART

Microsatellites, or short tandem repeats (STRs), consist of tandemly repeated DNA sequence motifs of 1 to 6 nucleotides in length. They are widely dispersed and abundant in the eukaryotic genome, and are often highly polymorphic due to variation in the number of repeat units. This polymorphism renders microsatellites attractive DNA markers for genetic mapping, medical diagnostics, and forensic investigation. The combination of PCR and gel or capillary electrophoresis under denaturing conditions has greatly improved the genotyping of microsatellite DNA sequences. However, PCR artifacts exhibited by non-proofreading enzymes and referred to as stutter and the terminal transferase side-reaction can complicate analysis of closely spaced microsatellite alleles.

Stutter signals differ from the PCR product representing the genomic allele by multiples of repeat unit size. For dinucleotide repeat loci, the prevalent stutter signal is generally two bases shorter than the genomic allele signal, with additional side-products that are 4 and 6 bases shorter. The multiple signal pattern observed for each allele especially complicates interpretation when two alleles from an individual are close in size (e.g., medical and genetic mapping applications) or when DNA samples contain mixtures from two or more individuals (e.g., forensic applications). Such confusion is maximal for mononucleotide microsatellite genotyping, when both genomic and stutter fragments experience one-nucleotide spacing.

All previous methods of analyzing mononucleotide, A repeat, “BAT” alleles have generated multiple stutter signals, frustrating accurate genotype determination. These studies have been able only to determine a size range for each allele.

There is a need in the art to develop PCR reaction conditions that minimize or eliminate stutter so that genetic analysis may be more accurate and reliable. This invention is directed to these, as well as other, important ends.

SUMMARY

In accordance with some embodiments of the methods of the invention, methods for reducing stutter in the amplification of a microsatellite are provided comprising the steps of:

(a) providing a sample comprising a microsatellite of interest, in which the microsatellite has a G+C content of 50% or less;

(b) contacting the sample with at least one enzyme having nucleic acid polymerase activity; and

(c) incubating the sample with the enzyme for a time and under conditions sufficient to amplify the microsatellite; wherein said incubation is performed in the presence of an amount of an additive effective to reduce stutter relative to the amount of stutter observed in the absence of the additive; wherein the additive is selected from the group consisting of betaine, sorbitol and mixtures thereof.

The invention also provides methods for reducing stutter in the amplification of a mononucleotide microsatellite comprising the steps of:

(a) providing a sample comprising a microsatellite of interest, in which the microsatellite has a G+C content of 50% or less;

(b) contacting the sample with at least one enzyme having nucleic acid polymerase activity; and

(c) incubating the sample with the enzyme for a time and under conditions sufficient to amplify the microsatellite; wherein the incubation is performed in the presence of an amount of an additive selected from the group consisting of sorbitol, betaine and mixtures thereof, wherein the additive is effective to reduce stutter relative to the amount of stutter observed in the absence of the additive.

The invention also provides methods for reducing stutter in the amplification of a dinucleotide microsatellite comprising the steps of:

(a) providing a sample comprising a microsatellite of interest, in which the microsatellite has a G+C content of 50% or less;

(b) contacting the sample with at least one enzyme having nucleic acid polymerase activity; and

(c) incubating the sample with the enzyme for a time and under conditions sufficient to amplify the microsatellite; wherein the incubation is performed in the presence of an amount of an additive selected from the group consisting of sorbitol, betaine and mixtures thereof, wherein the additive is effective to reduce stutter relative to the amount of stutter observed in the absence of the additive.

The invention further provides methods for reducing stutter in the amplification of a trinucleotide microsatellite comprising the steps of:

(a) providing a sample comprising a microsatellite of interest, in which the microsatellite has a G+C content of 50% or less;

(b) contacting the sample with at least one enzyme having nucleic acid polymerase activity; and

(c) incubating the sample with the enzyme for a time and under conditions sufficient to amplify the microsatellite; wherein the incubation is performed in the presence of an amount of an additive selected from the group consisting of sorbitol, betaine and mixtures thereof, wherein the additive is effective to reduce stutter relative to the amount of stutter observed in the absence of the additive.

The invention further provides methods for reducing stutter in the amplification of a tetranucleotide microsatellite comprising the steps of:

(a) providing a sample comprising a microsatellite of interest, in which the microsatellite has a G+C content of 50% or less;

(b) contacting the sample with at least one enzyme having nucleic acid polymerase activity; and

(c) incubating the sample with the enzyme for a time and under conditions sufficient to amplify the microsatellite; wherein the incubation is performed in the presence of an amount of an additive selected from the group consisting of sorbitol, betaine and mixtures thereof, wherein the additive is effective to reduce stutter relative to the amount of stutter observed in the absence of the additive.

The invention also provides methods for reducing stutter in the amplification of a pentanucleotide microsatellite comprising the steps of:

(a) providing a sample comprising a microsatellite of interest, in which the microsatellite has a G+C content of 50% or less;

(b) contacting the sample with at least one enzyme having nucleic acid polymerase activity; and

(c) incubating the sample with the enzyme for a time and under conditions sufficient to amplify the microsatellite; wherein the incubation is performed in the presence of an amount of an additive selected from the group consisting of sorbitol, betaine and mixtures thereof, wherein the additive is effective to reduce stutter relative to the amount of stutter observed in the absence of the additive.

In further embodiments of the methods of the invention, methods are provided comprising the steps of:

(a) providing a sample comprising a nucleic acid that contains one or more microsatellites selected from the group consisting of mononucleotide microsatellites, dinucleotide microsatellites, trinucleotide microsatellites, tetranucleotide microsatellites and pentanucleotide microsatellites; and

(b) amplifying at least one nucleobase sequence of said nucleic acid, said nucleobase sequence comprising at least one of said microsatellites; said amplified microsatellite having a G+C content of 50% or less; wherein said amplification is performed in the presence of an additive selected from the group consisting of betaine, sorbitol and mixtures thereof.

Also provided in accordance with the present invention are methods for performing polymerase chain reaction amplification of a microsatellite selected from the group consisting of mononucleotide microsatellites, dinucleotide microsatellites, trinucleotide microsatellites, tetranucleotide microsatellites, and pentanucleotide microsatellites, said microsatellite having a G+C content of 50% or less; said method comprising the step of contacting said microsatellite with a polymerase in the presence of an amount of an additive effective to reduce the amount of stutter arising from said amplification relative to the amount of such stutter observed in the absence of said additive, wherein the additive is selected from the group consisting of sorbitol, betaine and mixtures thereof.

Also provided by the present invention are methods of detecting cancer or a pre-cancerous condition and genetic disorders, such as Friedreich's ataxia, in a subject comprising amplifying a region of DNA from a subject, wherein said region comprises a microsatellite selected from the group consisting of a mononucleotide repeat, a dinucleotide repeat, a trinucleotide repeat, a tetranucleotide repeat, and a pentanucleotide repeat, wherein said amplification comprises the steps of:

(a) providing a sample comprising a nucleic acid that contains a nucleic acid having a microsatellite instability,

(b) amplifying at least one nucleobase sequence of the nucleic acid, in which the nucleobase sequence comprises at least one of the microsatellites; and

(c) detecting alterations of the microsatellite as compared to corresponding microsatellites amplified from control tissue; the amplified microsatellite having a G+C content of 50% or less; wherein the amplification is performed in the presence of a sufficient amount of an additive to reduce stutter from that observed in the absence of the additive, wherein the additive is selected from the group consisting of sorbitol, betaine and mixtures thereof.

In some embodiments, the cancer or cancerous condition is colon cancer (HNPCC), or breast cancer. In further embodiments, the microsatellite amplification comprises at least one genetic locus, for example, (A)_(n).

In some embodiments, the present invention also provides methods of genetic mapping, comprising amplifying a plurality of regions of DNA from a sample containing DNA from a subject, wherein the regions comprise at least one microsatellite selected from the group consisting of a mononucleotide repeat, a dinucleotide repeat, a trinucleotide repeat, a tetranucleotide repeat, and a pentanucleotide repeat, wherein the amplification comprises the steps of:

(a) contacting said DNA with a enzyme at least one enzyme having nucleic acid polymerase activity; and

(b) incubating said sample with the enzyme for a time and under conditions sufficient to amplify the regions; and

(c) separating amplified regions, forming a microsatellite pattern; wherein the incubation is performed in the presence of an amount of an additive effective to reduce stutter relative to the amount of stutter observed in the absence of the additive, wherein the additive is selected from the group consisting of sorbitol, betaine and mixtures thereof.

In further embodiments, the present invention also provides methods of personal genetic identification comprising amplifying a plurality of regions of DNA from a sample containing DNA from a subject, wherein said regions comprise at least one microsatellite selected from the group consisting of a mononucleotide repeat, a dinucleotide repeat, a trinucleotide repeat, a tetranucleotide repeat, and a pentanucleotide repeat; wherein the microsatellite has a G+C content of 50% or less; wherein the amplification comprises the steps of:

(a) contacting said DNA with a enzyme at least one enzyme having nucleic acid polymerase activity; and

(b) incubating the sample with the enzyme for a time and under conditions sufficient to amplify the regions;

(c) separating amplified regions, forming a microsatellite pattern; and

(d) comparing the microsatellite pattern with a corresponding microsatellite pattern derived from the a DNA sample from a second source; wherein the incubation is performed in the presence of an amount of an additive effective to reduce stutter relative to the amount of stutter observed in the absence of the additive; wherein the additive is selected from the group consisting of sorbitol, betaine, and mixtures thereof.

In some embodiments, the subject is a forensic sample and a said second source comprises at least one selected from the group consisting of the presumed matching source, a family member of the presumed matching source, and a database of sources.

In some embodiments of the methods of the invention, where the microsatellite is a dinucleotide microsatellite, the microsatellite comprises a dinucleotide repeat selected from the group consisting of CA/TG and CT/AG. In further embodiments where the microsatellite is a trinucleotide microsatellite, the microsatellite comprises the trinucleotide repeat GAA/TTC. In further embodiments where the microsatellite is a tetranucleotide microsatellite, the microsatellite comprises a tetranucleotide repeat selected from the group consisting of TCTA/TAGA, AGAA/TTCT, AAGG/CCTT, AATG/CATT, TCTG/CAGA, and TAGG/CCTA. In further embodiments where the microsatellite is a pentanucleotide microsatellite, the microsatellite comprises the pentanucleotide repeat AAAGA/TCTTT.

In some embodiments of the method of the invention, the microsatellite has a G+C content of 50% or less. In further embodiments, the microsatellite has a G+C content of 40% or less. In further embodiments, the microsatellite has a G+C content of 30% or less. In further embodiments, the microsatellite has a G+C content of 20% or less. In further embodiments, the microsatellite has a G+C content of 10% or less.

In some embodiments of the invention, the amount of stutter is reduced to 90% or less than the amount of stutter obtained in the absence of sorbitol and/or betaine. In other embodiments the amount of stutter is reduced to 80% or less. In other embodiments, the amount of stutter is reduced to 70% or less. In other embodiments, the amount of stutter is reduced to 60% or less. In other embodiments, the amount of stutter is reduced to 50% or less. In other embodiments, the amount of stutter is reduced to 40% or less. In other embodiments, the amount of stutter is reduced to 30% or less.

In some embodiments of the methods of the invention, the amplification comprises contacting said nucleobase sequence with an enzyme having a polymerase activity. For example, the enzyme having polymerase activity may be selected from the group consisting of DNA polymerase from Thermus aquaticus, Thermus thermophilus, other Thermus species, Bacillus species, Thermococcus species, Thermotoga species, and Pyrococcus species. For example, suitable polymerases include AmpliTaq Gold® DNA polymerase; AmpliTaq® DNA Polymerase; AmpliTaq® DNA Polymerase, Stoffel fragment; rTth DNA Polymerase; rTth DNA Polymerase XL; Bst DNA polymerase large fragment from Bacillus stearothermophilus; Vent and Vent Exo—from Thermococcus litoralis; Tma from Thermotoga maritima; Deep Vent and Deep Vent Exo— and Pfu from Pyrococcus; and mutants, variants and derivatives thereof.

Also provided in certain embodiments of the invention are compositions comprising:

(a) a nucleic acid sequence comprising a microsatellite, in which the microsatellite has a G+C content of 50% or less, the microsatellite being selected from the group consisting of mononucleotide microsatellites, dinucleotide microsatellites, trinucleotide microsatellites, tetranucleotide microsatellites and pentanucleotide microsatellites;

(b) at least two primers, each of said primers having a sequence that is substantially complementary to a portion of the nucleic acid sequence that is adjacent to the microsatellite;

(c) at least one enzyme having nucleic acid polymerase activity; and

(d) an additive selected from the group consisting of betaine, sorbitol and mixtures thereof.

In some embodiments of the methods and compositions of the invention, sorbitol and/or betaine is present in an amount of from 1.5 to 3.5 M. In other embodiments, sorbitol and/or betaine is present in an amount of 2.0 to 3.0 M. In other embodiments, sorbitol and/or betaine is present in an amount of 2.0 M.

In some embodiments of the invention, at least 0.5 mM each of dNTPs are used. In other embodiments, at least 1 mM dNTPs are used.

In some embodiments, the present invention also provides kits for amplification of a target nucleic acid sequence, the target nucleic acid sequence comprising a microsatellite having a G+C content of 50% or less, selected from the group consisting of mononucleotide microsatellites, dinucleotide microsatellites, trinucleotide microsatellites, tetranucleotide microsatellites and pentanucleotide microsatellites, comprising, in separate containers: a polymerase, a plurality of deoxynucleotide triphosphates; and sorbitol and/or betaine. In some embodiments of the compositions and kits of the invention, the polymerase is selected from the group consisting of a DNA polymerase from Thermus aquaticus, Thermus thermophilus, other Thermus species, Bacillus species, Thermococcus species, Thermotoga species, and Pyrococcus species. For example, suitable polymerases include, but are not limited to, AmpliTaq Gold® DNA polymerase; AmpliTaq® DNA Polymerase; AmpliTaq® DNA Polymerase, Stoffel fragment; rTth DNA Polymerase; rTth DNA Polymerase XL; Bst DNA polymerase large fragment from Bacillus stearothermophilus; Vent and Vent Exo—from Thermococcus litoralis; Tma from Thermotoga maritima; Deep Vent and Deep Vent Exo— and Pfu from Pyrococcus; and mutants, variants and derivatives thereof.

In some further embodiments of the invention, methods are provided in which a sample containing nucleic acid that is suspected of containing one or more microsatellites having a G+C content of 50% or less is contacted with an enzyme that polymerizes nucleotides in the presence of an effective amount of betaine and/or sorbitol to reduce observed stutter relative to the amount of stutter observed in the absence of betaine and/or sorbitol, and amplifying at least one nucleobase sequence containing at least one microsatellite of the nucleic acid contained in the sample. Such microsatellites may include mononucleotide, dinucleotide, trinucleotide, tetranucleotide, and/or pentanucleotide microsatellites.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows GeneScan traces from PCR amplifications of a mononucleotide microsatellite ((A/T)_(n)) for control conditions (upper panel) and with added betaine (lower panel).

FIG. 2 shows GeneScan traces from PCR amplifications of a dinucleotide microsatellite ((CA/TG)_(n)) for control conditions (upper panel) and with added betaine (lower panel).

FIG. 3 shows GeneScan traces from PCR amplifications of a tetranucleotide microsatellite ((CTTT/AAAG)_(n)) for control conditions (upper panel) and with added betaine (lower panel).

FIG. 4 shows GeneScan traces from PCR amplifications of a mononucleotide microsatellite ((A/T)_(n)) for control conditions (upper panel) and with added sorbitol (lower panel).

FIG. 5 shows GeneScan traces from PCR amplifications of a dinucleotide microsatellite ((CA/TG)_(n)) for control conditions (upper panel) and with added sorbitol (lower panel).

FIG. 6 shows GeneScan traces from PCR amplifications of a tetranucleotide microsatellite ((CTTT/AAAG)_(n)) for control conditions (upper panel) and with added sorbitol (lower panel).

DETAILED DESCRIPTION

Most of the words used in this specification have the meaning that would be attributed to those words by one skilled in the art. Words specifically defined in the Specification have the meaning provided in the context of the present invention as a whole, and as are typically understood by those skilled in the art. In the event that a conflict arises between an art-understood definition of a word or phrase and a definition of the word or phrase as specifically taught in this specification, the specification shall control. Headings used herein are merely for convenience, and are not to be construed as limiting in any way.

Standard reference works setting forth the general principles of recombinant DNA technology known to those of skill in the art include Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, 1998 Molecular Cloning: A Laboratory Manual (3rd ed.) Sambrook, J. & D. Russell, Eds. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (2001); Kaufman et al., Eds., HANDBOOK OF MOLECULAR AND CELLULAR METHODS IN BIOLOGY AND MEDICINE, CRC Press, Boca Raton, 1995; McPherson, Ed., DIRECTED MUTAGENESIS: A PRACTICAL APPROACH, IRL Press, Oxford, 1991.

As used herein, the term “microsatellite” refers to a genetic locus comprising a short (e.g., 1-5 nucleotide), tandemly repeated sequence motif. As used herein “mononucleotide microsatellite” refers to a genetic locus comprising a repeated nucleotide (e.g., A/T). “Dinucleotide microsatellite” refers to a genetic locus comprising a motif of two nucleotides that is tandemly repeated (e.g., CA/TG, CT/GA). “Trinucleotide microsatellite” refers to a genetic locus comprising motif of three nucleotides that is tandemly repeated (e.g., GAA/TTC). “Tetranucleotide microsatellite” refers to a genetic locus comprising a motif of four nucleotides that is tandemly repeated (e.g., TCTA/TAGA, AGAT/ATCT, AGAA/TTCT, AAAG/CTTT, AATG/CATT, TTTC/GAAA, CTTT/AAAG and GATA/TATC). “Pentanucleotide microsatellite” refers to a genetic locus comprising a motif of five nucleotides that is tandemly repeated (e.g., AAAGA/TCTTT). Microsatellites may contain repeat-motif interspersions, or “cryptically simple sequence” (Tautz, D. et al. (1986) Nature 322(6080):652-656). Such repeat-motif interspersions include simple repeat-motif interspersions wherein the microsatellite contains one or more interspersed repeats with the same length as the tandemly repeated sequence motif, but a different repeat sequence (Eichler, E. E. et al. (1994) Nat. Genet. 8:88-94; Eichler, E. E. et al. (1996) Hum. Mol. Genet. 5:319-330). For example, if the tandemly repeated sequence motif is TCTA, a simple repeat-motif interspersion may appear as follows: TCTA(TCTG)₂(TCTA)₃ (SEQ ID NO:39), wherein the interspersed repeat “TCTG” interrupts the repeat of the TCTA tandemly repeated sequence motif. Repeat-motif interspersions also include more complex repeat-motif interspersions wherein the repeat motif interspersion is not the same length as the tandemly repeated sequence motif. For example, if the tandemly repeated sequence motif is TCTA, the complex repeat-motif interspersion may appear as follows: (TCTA)₃TA(TCTA)₃TCA(TCTA)₂ (SEQ ID NO:40), wherein the tandemly repeated sequence motif is interrupted by TA and TCA. Other more complex repeat motif interspersions include the combination of the simple repeat-motif interspersion and the complex repeat-motif interspersion in the same microsatellite. For example, such a complex sequence repeat-motif interspersion may appear as follows: (TCTA)_(n)(TCTG)_(o)(TCTA)₃TA(TCTA)₃TCA(TCTA)₂TCCATA(TCTA)_(p) (SEQ ID NO:41), wherein both forms of interspersed repeats interrupt the tandemly repeated sequence motif, TCTA. Microsatellites with and without interspersed repeats are encompassed by the term “microsatellites” as used herein.

As used herein, the term “stutter” or “stutter signal” refers to a PCR artifact wherein microsatellites are incorrectly amplified such that a diverse population of fragments of varying length are produced for each allele in the genomic source DNA. A typical “stutter signal” results from one or more PCR products that differ from the appropriate length of the microsatellite-containing fragment by one or more repeat-unit lengths of the microsatellite. As used herein “appropriate length of the microsatellite-containing fragment” refers to the length predicted from the primer sequences and the genomic target sequence, with or without one added nucleotide, depending on whether the PCR conditions promote or suppress the polymerase terminal transferase side reaction. A stutter signal may be perceived with the naked eye, such as by examining a band on an agarose or polyacrylamide gel, or may be perceived with the aid of instrumentation. A stutter signal seen on a gel typically appears as a blurry, shadow band due to microsatellites which are incorrectly amplified such that a diverse population of fragments of varying length are produced. A stutter signal as detected on a GeneScan trace or other electropherogram may appear as a quantified signal, such as a peak on a graph. Stutter signals may be represented by any means, such as, but not limited to brightness, intensity (e.g., maximum intensity), magnitude of a signal output (e.g., peak height, integration of the area of a peak, peak width at half peak height), and the like. For example, in a GeneScan trace for a tetranucleotide microsatellite, a major stutter signal is typically seen as a peak found at four nucleobase units downfield (i.e., at a location corresponding to a shorter fragment) on the electropherogram from the major peak, which represents the allele. Other, less prominent stutter signals may be found at 8 and 12 nucleobase units downfield on the chromatogram. As used herein, the term “reducing stutter” is intended to mean the production of lower amounts of amplified stutter product, as reflected by a decrease in the number or signal intensity of stutter fragments.

As used herein “sorbitol” refers to the polyol (polyhydric alcohol) corresponding to glucose, represented by the following structural formula:

As used herein “betaine” refers to N,N,N-trimethylglycine.

As used herein, the term “isolated nucleic acid molecule” refers to a nucleic acid molecule (DNA or RNA) that has been removed from its native environment.

As used herein, “DNA” refers to deoxyribonucleic acid in its various forms as understood in the art, such as genomic DNA, cDNA, isolated nucleic acid molecules, vector DNA, chromosomal DNA. “Nucleic acid” refers to DNA or RNA in any form. Examples of isolated nucleic acid molecules include, but are not limited to, recombinant DNA molecules contained in a vector, recombinant DNA molecules maintained in a heterologous host cell, partially or substantially purified nucleic acid molecules, and synthetic DNA molecules. Typically, an “isolated” nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., sequences located at the 5′ and 3′ ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. Moreover, an “isolated” nucleic acid molecule, such as a cDNA molecule, is generally substantially free of other cellular material or culture medium when produced by recombinant techniques, or of chemical precursors or other chemicals when chemically synthesized.

As used herein “nucleobase sequence” refers to a sequence of consecutive nucleobases.

As used herein, “anneal” refers to specific interaction between strands of nucleotides wherein the strands bind to one another substantially based on complementarity between the strands as determined by Watson-Crick base pairing. It is not necessary that complementarity be 100% for annealing to occur.

As used herein, “amplifying” refers to enzymatically increasing the amount of a specific nucleotide sequence in a polymerase chain reaction.

As used herein “incubating” refers to a maintaining a state of controlled conditions such as temperature over a period of time.

As used herein “denaturation” refers to the separation of nucleotide strands from an annealed state. Denaturation may be induced by a number of factors including ionic strength of the buffer, temperature, or chemicals that disrupt base pairing interactions.

As used herein “G+C content” refers to the relative amount of guanosine and cytosine present in a given nucleic acid or portion thereof that is of interest, such as a microsatellite. The “G+C content” of a given nucleobase sequence, expressed in percent, can be calculated from the formula 100(#G+#C)/Tot where #G is the number of guanine nucleobases in the nucleobase sequence, #C is the number of cytosine nucleobases in the nucleobase sequence, and Tot is the total number of nucleobases in the nucleobase sequence.

As used herein, “sufficient amount of time” when referring to time for the amplification of nucleic acid, refers to the time which allows the enzyme used to complete the polymerization of deoxynucleotide triphosphates into the amplifying nucleic acid. The amount of time required varies depending on several factors which are well-known by persons of ordinary skill in the art. General principles of PCR and strategies for amplification may be found in such texts as, for example, Ausubel et al., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, New York, 2001 and THE POLYMERASE CHAIN REACTION, Mullis, K. B., F. Ferre, and R. A. Gibbs, Eds., Birkhauser, Boston, 1994; AND MOLECULAR CLONING: A LABORATORY MANUAL (3rd ed.) Sambrook, J. & D. Russell, Eds. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (2001).

As used herein “conditions sufficient to amplify microsatellites” refers to reaction conditions for the PCR reactions. The reaction conditions include the chemical components of the reaction and their concentrations, the temperatures used in the reaction cycles, the number of cycles of the reaction, and the durations of the stages of the reaction cycles.

Typically, buffered water is used as the milieu for the reaction. The other chemical components of standard PCR reactions include a DNA polymerase, deoxyribonucleoside triphosphates (“dNTPs”), oligonucleotide primers, divalent metal ion, and a DNA sample expected to contain the PCR target.

The solvent used for PCR typically contain a buffering agent such as Tris-HCl and non-buffering salts such as KCl. The buffering agent may be any known buffers in the art, and may be varied to optimize PCR results by routine experimentation. Persons of ordinary skill in the art will readily be able to determine optimal buffering conditions. Some PCR buffers may be optimized depending on the enzyme used. As an example, but not by way of limitation, AmpliTaq Gold® DNA polymerase has an optimum KCl concentration of 50 mM, AmpliTaq® DNA Polymerase, Stoffel fragment has an optimum KCl concentration of 10 mM, and rTth DNA Polymerase and rTth DNA Polymerase XL, have an optimum KCl concentration of 75-100 mM.

Divalent metal ions are often advantageous to allow the polymerase to function efficiently. For example, but not by way of limitation, magnesium ion allows certain DNA polymerases to function effectively. Typically, MgCl₂ or MgSO₄, is added to reaction buffers to supply the optimum magnesium ion concentration. The magnesium ion concentration required for optimal PCR amplification may depend on the specific set of primers and template used. Thus, the amount of magnesium salt added to achieve optimal amplification is often determined empirically, and is a routine practice in the art. Generally, the concentration of magnesium ion for optimal PCR can vary between 1 and 10 mM. A typical range of magnesium ion concentration in PCR reactions is between 1.0 and 4.0 mM, varying around a midpoint of 2.5 mM.

Deoxynucleotide triphosphates (“dNTPs”), which are the building blocks of the amplifying nucleic acid molecules, are typically supplied in standard PCR reactions at a concentration of 40-200 μM each of deoxyadenosine triphosphate (“dATP”), deoxyguanosine triphosphate (“dGTP”), deoxycytidine triphosphate (“dCTP”) and thymidine triphosphate (“dTTP”). Other dNTPs, such as deoxyuridine triphosphate (“dUTP”), and dNTP analogs, and conjugated dNTPs may also be used, and are encompassed by the term “dNTPs” as used herein. While use of dNTPs at such concentrations is amenable to the methods of the invention, concentrations of dNTPs higher than 200 μM may be advantageous. Thus, in some embodiments of the methods of the invention, the concentration of each dNTP is generally at least 500 μM and may range up to 2 mM. In some further embodiments, concentration of each dNTP may range from 0.5 mM to 1 mM.

The enzyme that polymerizes the nucleotide triphosphates into the amplified fragments of the PCR may be any DNA polymerase, including heat-resistant polymerases known in the art. Polymerases that may be used in the invention include, but are not limited to DNA polymerases from such organisms as Thermus aquaticus, Thermus thermophilus, Thermococcus litoralis, Bacillus stearothermophilus, Thermotoga maritima and Pyrococcus ssp. The enzyme may be isolated from the source bacteria, produced by recombinant DNA technology or purchased from commercial sources. For example, DNA polymerases are available from Applied Biosystems and include AmpliTaq Gold® DNA polymerase; AmpliTaq® DNA Polymerase; AmpliTaq® DNA Polymerase, Stoffel fragment; rTth DNA Polymerase; and rTth DNA Polymerase XL. Other suitable polymerases include, but are not limited to Tne, Bst DNA polymerase large fragment from Bacillus stearothermophilus, Vent and Vent Exo— from Thermococcus litoralis, Tma from Thermotoga maritima, Deep Vent and Deep Vent Exo— and Pfu from Pyrococcus, and mutants, variants and derivatives of the foregoing.

Oligonucleotide primers are added to the reaction and demarcate the 5′ and 3′ ends of the amplified fragment. One oligonucleotide primer anneals to the sense (+strand) of the denatured, template DNA, and the other oligonucleotide primer anneals to the antisense (−strand) of the denatured, template DNA. Typically, oligonucleotide primers are 12-25 nucleotides in length, however, they may be shorter or longer depending on the specific template sequence to be amplified, and the length of the primer is not essential to the operation of the invention. Oligonucleotide primers may be designed to anneal to specific portions of DNA that flank a microsatellite of interest to specifically amplify the portion of DNA between the primer-complementary sites. Generally, oligonucleotide primers are chemically synthesized. One of ordinary skill in the art may easily design specific primers to amplify a target microsatellite of interest. Furthermore, there are many known primer sequences to amplify microsatellite regions. Any of these may be used, and are within the scope of the invention.

The oligonucleotide primers may be composed of adenosine, thymidine, guanosine, cytidine, uracil, nucleoside analogs (e.g., locked nucleic acids (LNA), peptide nucleic acid (PNA), phosphoramidites) and nucleosides containing or conjugated to chemical moieties such as radionuclides (e.g., ³²P, ³⁵S), fluorescent molecules, minor groove binders, or any other nucleoside conjugate known in the art.

In some embodiments of the invention, a fluorophore is used to tag at least one primer of the PCR reaction. In some embodiments primers for different target fragments can be tagged with different fluorophores (that produce differently colored products) and may be used in the same multiplex PCR reaction and subsequently analyzed together. Typically, the forward primer is tagged, but the reverse primer may also be tagged. Examples of fluorophores include, but are not limited to, fluorescein (which absorbs maximally at 492 nm and emits maximally at 520 nm); TAMRA, N,N,N′,N′-tetramethyl-6-carboxyrhodamine (which absorbs maximally at 555 nm and emits maximally at 580 nm); FAM, 5-carboxyfluorescein (which absorbs maximally at 495 nm and emits maximally at 525 nm); JOE, 2′,7′-dimethoxy-4′,5′-dichloro-6-carboxyfluorescein (which absorbs maximally at 525 nm and emits maximally at 555 nm), ROX, 6-carboxy-X-rhodamine (which absorbs maximally at 585 nm and emits maximally at 605 nm); CY3 (which absorbs maximally at 552 nm and emits maximally at 570 nm), CY5 (which absorbs maximally at 643 nm and emits maximally at 667 nm); TET, tetrachloro-fluorescein (which absorbs maximally at 521 nm and emits maximally at 536 nm); and HEX, hexachloro-fluorescein (which absorbs maximally at 535 nm and emits maximally at 556 nm).

Other known components of PCR reactions may be used within the scope of the invention. Such components include, but are not limited to, detergents (e.g., Triton X-100, Nonidet P-40 (NP-40), Tween-20) and agents that disrupt mismatching of nucleotide pairs, such as dimethylsulfoxide (DMSO), and tetramethylammonium chloride (TMAC).

As used herein the term “additive” when referring to a PCR reaction, refers to sorbitol, betaine and mixtures thereof.

PCR reaction times, temperatures and cycle numbers may be varied to optimize a particular reaction as a matter of routine experimentation. Those of ordinary skill in the art will recognize the following as guidance in determining the various parameters for PCR reactions, and also will recognize that variation of one or more conditions is within the scope of the invention.

PCR reaction temperature and time are determined in three stages: denaturation, annealing and extension. One round of denaturation, annealing and extension is referred to as a “cycle.” Denaturation is generally conducted at a temperature that permits the strands of DNA to separate, yet not destroy polymerase activity. Generally, thermoresistant polymerases are used. However, heat-labile polymerases may be used if they are replenished after each denaturation step of the PCR. Thermoresistant polymerases can withstand high temperatures and maintain some level of activity. Typically, denaturation is conducted above 90° C. and below 100° C. In some embodiments, denaturation is conducted at a temperature of 94-95° C. Denaturation of DNA is generally conducted for at least 1 to 30 seconds. In some embodiments, denaturation is conducted for 1 to 15 seconds. In other embodiments, denaturation is conducted for up to 1 minute or more. In addition to the denaturation of DNA, for some polymerases, such as AmpliTaq Gold®, incubation at the denaturation temperature also serves to activate the enzyme. Therefore, it may be advantageous to allow the first step of PCR (denaturation) to be longer than subsequent denaturation steps when these enzymes are used.

During the annealing phase, oligonucleotide primers anneal to the target DNA in their regions of complementarity and are substantially extended by the DNA polymerase once the latter has bound to the primer-template duplex.

In a conventional PCR, the annealing temperature typically is at or below the melting point (T_(m)) of the least stable primer-template duplex, where T_(m) can be estimated by any of several theoretical methods well known to practitioners of the art. For example, the T_(m) may be determined by the formula: T _(m)=(4° C.×number of G and C bases)+(2° C.×number of A and T bases) Typically, in standard PCRs, the annealing temperature is 5° C. to 10° C. below the estimated T_(m) of the least stable primer-template duplex. The annealing time is between about 30 seconds and 2 minutes. However, in certain embodiments of the methods of the invention, the high concentration of betaine and/or sorbitol increases reagent viscosity and appears to slow certain steps of the reaction (e.g., primer annealing and polymerase binding to the primer-template duplex). Thus, in certain embodiments of the methods of the invention, the annealing step is performed for a longer period of time than would be used in standard PCR protocols, typically for at least 3 minutes and as long as 5 to 6 minutes.

Sorbitol and betaine not only increase reaction viscosity, but also are mild DNA denaturants. Thus, in certain embodiments of the methods of the invention, it is may be advantageous to use a lower temperature for annealing primers to the template than would be used by one of ordinary skill in the art for standard PCR reactions. In general, temperatures lower than 10° C. below the T_(m) (estimated in the absence of additive) may be employed in certain embodiments of the invention.

The annealing phase typically is followed by an extension phase. “Extension” is conducted for a sufficient amount of time to allow the enzyme to complete primer extension into the appropriately sized fragments. As discussed above, the addition of high concentrations of sorbitol and/or betaine increases the viscosity of the reaction, making unconventionally long extension times advantageous in certain embodiments of the methods of the invention; i.e., the use of extension times that are longer compared to extension times one of ordinary skill in the art would calculate for standard PCR reactions. Furthermore, as noted above for the annealing phase, sorbitol and betaine are mild denaturants. Thus, in some embodiments of the methods of the invention, it may be advantageous to also use a lower temperature for extension than would be used by one of ordinary skill in the art for standard PCR reactions. Thus, for some embodiments, temperatures for extension are below the temperature reported for optimal activity of the polymerases used.

The number of cycles of PCR (denaturation, annealing and extension) used will determine the desired amount of amplification. PCR is an exponential amplification of DNA molecules. Thus, theoretically, after each cycle of PCR, there are twice the number of fragments that were present in the previous cycle. Typically, 20-30 cycles of PCR are performed. More typically, 25-30 cycles are performed, although cycle number is not particularly limited.

For some embodiments, it is advantageous to incubate the reactions at a certain temperature following the last phase of the last cycle of PCR. In some embodiments, a prolonged extension phase is selected. In other embodiments, an incubation at a low temperature (e.g., 4° C.) is selected.

In one embodiment of the invention, methods are provided for reducing stutter in the amplification of a microsatellite wherein a sample containing a microsatellite of interest is provided, wherein the microsatellite has a G+C content of 50% or less. The sample is contacted with at least one enzyme having nucleic acid polymerase activity, and the sample is incubated with the enzyme for a sufficient amount of time and under conditions sufficient to amplify said microsatellite. The incubation is performed in the presence of an amount of betaine and/or sorbitol that is effective to reduce stutter relative to the amount of stutter observed in the absence of betaine and/or sorbitol. The PCR reaction includes primers that are selected to amplify the target microsatellite of interest and which are optionally tagged, dNTPs, buffer, the sample containing the target nucleic acid to be amplified, betaine and/or sorbitol and the polymerase.

In another embodiment of the invention, primer extension reactions may be performed with greater accuracy when conducted in the presence of sorbitol and/or betaine. In primer extension reactions, an oligonucleotide primer is permitted to bind to a specific target sequence on a nucleic acid molecule in the presence of dNTPs and a polymerase. The reaction is incubated and the polymerase polymerizes the dNTPs and extends the oligonucleotide primer in the 5′ to 3′ direction to form the complement of the target nucleic acid molecule. The primer may contain a detectable label, or the dNTPs used may contain a detectable label. Further, the dNTPs may be modified dNTPs, including, such as, but not limited to, dideoxynucleotide triphosphates.

The microsatellites amplified in the methods of the invention may be mononucleotide microsatellites, dinucleotide microsatellites, trinucleotide microsatellites, tetranucleotide microsatellites and/or pentanucleotide microsatellites, and the microsatellites may include repeat motif interspersions. The mononucleotide microsatellites of the invention comprise repeats of A, such as have been noted in some tumor markers where microsatellite instability has been described. The complementary strand of this portion of DNA would contain repeats of T. To denote the microsatellite with its complementary strand, the notation A/T is used. In some embodiments, the microsatellite is a dinucleotide microsatellite. Examples of dinucleotide microsatellites include, but are not limited to CA/TG, and CT/AG. In other embodiments, the microsatellite is a trinucleotide microsatellite, including but not limited to GAA/TTC. In still other embodiments, the microsatellite is a tetranucleotide microsatellite, including, but not limited to TCTA/TAGA, AGAA/TTCT, AAGG/CCTT, AATG/CATT, TTTC/GAAA, TCTG/CAGA, and TAGG/CCTA. In other embodiments, the microsatellite is a pentanucleotide microsatellite, including but not limited to AAAGA/TCTTT. Examples of loci that contain such tetranucleotide microsatellites include, but are not limited to D3S1358 ((TCTA)(TCTG)₁₋₃(TCTA)_(n)) (SEQ ID NOS: 42-44); VWA (TCTA(TCTG)₃₋₄(TCTA)_(n)) (SEQ ID NOS: 44 and 46); D16S539 ((AGAT)_(n)); D8S1179((TCTR)_(n)) (wherein R is purine); D21S11((TCTA)_(m)(TCTG)_(n)(TCTA)₃TA(TCTA)₃TCA(TCTA)₂TCCATA(TCTA)_(o)) (SEQ ID NO: 41); D18S51((AGAA)_(n)); D19S433((AAGG)(AAAG)(AAGG)(TAGG)(AAGG)_(n)) (SEQ ID NO: 47); TH01((AATG_(n)); FGA ((TTTC)₃TTTTTCT(CTTT)_(n)CTCC(TTCC)₂) (SEQ ID NO: 48); D7S820((GATA)_(n)); D13S317((GATA)_(n)); D5S818((AGAT)_(n)); CSF1PO ((AGAT)_(n)); and TPOX ((AATG)_(n)).

The microsatellites amplified in the method of the invention generally have a G+C content of 50% or less. In some embodiments, the microsatellite has a G+C content of 40% or less. In other embodiments, the microsatellite has a G+C content of 30% or less. In other embodiments, the microsatellite has a G+C content of 20% or less.

The amount of sorbitol and/or betaine added to the PCR reaction is generally in an amount effective to reduce stutter relative to the amount of stutter observed in the absence of sorbitol and/or betaine. In some embodiments, the amount of sorbitol and/or betaine added to the PCR reaction is generally 1.5 to 3.5 M. In some embodiments, sorbitol and/or betaine is added in an amount of 2.0 to 3.0 M. In other embodiments, sorbitol and/or betaine is added in an amount of 2.0 M. The sorbitol and/or betaine may be added from a separate stock or may be added as part of another PCR reagent. For example, in one embodiment of the invention, the sorbitol and/or betaine is included in the DNA polymerase preparation and is added when the enzyme is added to the reaction. In another embodiment of the invention, the sorbitol and/or betaine is included with a preparation of magnesium ion containing reagent (e.g., MgCl₂ or MgSO₄) so that the sorbitol and/or betaine is added with the MgCl₂ or MgSO₄.

Sorbitol and betaine may be included together in the same reaction. When sorbitol and betaine are used together, the concentration of the total amount of additive (i.e., sorbitol and betaine) is generally 1.5 to 3.5 M. In some embodiments, the amount of additive is 2.0 to 3.0 M. In other embodiments the amount of additive is 2.0 M. The additive may be in any combination of sorbitol with betaine, from 100% sorbitol to 100% betaine. For example, but not by way of limitation, the additive may be 95%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, or 5% betaine, the remainder being sorbitol; or 95%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50%, 45%, 40%, 35%, 30%, 25%, 20%, 15%, 10%, or 5% sorbitol, the remainder being betaine.

Sorbitol and betaine have different chemical properties which may be exploited for various purposes. For example, sorbitol is more hydrolytically inert than betaine, which may impart a longer shelf-life to the PCR reagents. As another example, betaine has a carboxylate group which may affect the pH of the PCR reaction when betaine is added in high concentrations, whereas sorbitol possesses no acid-base properties.

In accordance with some embodiments of the invention, betaine and/or sorbitol is employed in PCR protocols using concentrations of dNTPs that are higher than those currently employed in standard PCR protocols. Thus, in some embodiments of the methods of the invention, PCR is employed to amplify one or more mono- di-, tri-, tetra-, or pentanucleotide microsatellites wherein the PCR reaction mixture includes concentrations of dNTPs from 0.5 mM to 2.0 mM, and concentrations of sorbitol, betaine, or mixtures thereof, at a concentration of 1.5 to 3.5 M. In further embodiments of the methods of the invention, PCR is employed to amplify one or more mono-, di-, tri-, tetra- or pentanucleotide microsatellites wherein the PCR reaction mixture includes concentrations of dNTPs from 1.0 mM to 2.0 mM, and concentrations of sorbitol, betaine, or mixtures thereof, at a concentration of 2.0 to 3.0 M.

In accordance with the embodiments of the methods of the invention, the addition of betaine and/or sorbitol to the reactions is effective in reducing stutter to between 90% and 20% of the amount of stutter obtained in the absence of betaine and/or sorbitol. In some embodiments, the amount of stutter observed with the addition of betaine and/or sorbitol is reduced to 90% or less than the amount observed in the absence of betaine and/or sorbitol. In some embodiments, the amount of stutter observed with the addition of betaine and/or sorbitol is reduced to 80% or less than the amount observed without the addition of betaine and/or sorbitol. In other embodiments, the amount of stutter observed with the addition of betaine and/or sorbitol is reduced to 70% or less than the amount observed without the addition of betaine and/or sorbitol. In other embodiments, the amount of stutter observed with the addition of betaine and/or sorbitol is reduced to 60% or less than the amount observed without the addition of betaine and/or sorbitol. In other embodiments, the amount of stutter observed with the addition of betaine and/or sorbitol is reduced to 50% or less than the amount observed without the addition of betaine and/or sorbitol. In other embodiments, the amount of stutter observed with the addition of betaine and/or sorbitol is reduced to 40% or less than the amount observed without the addition of betaine and/or sorbitol. In other embodiments, the amount of stutter observed with the addition of betaine and/or sorbitol is reduced to 30% or less than the amount observed without the addition of betaine and/or sorbitol. In other embodiments, the amount of stutter observed with the addition of betaine and/or sorbitol is reduced to 20% or less than the amount observed without the addition of betaine and/or sorbitol. The reduction of stutter is measured by determining the percent stutter in the presence of additive (+A), and in the absence of additive (−A), and performing the following calculation (+A)/(−A)×100. Percent stutter is determined, for example, by the peak height of the stutter signal over the peak height of the allele signal times 100. Alternatively, percent stutter is determined by peak width at half peak height of the stutter signal over the peak width at half peak height of the allele signal times 100. In other embodiments, percent stutter is determined by area of the peak of the stutter signal over area of the peak of the allele signal times 100. Any other method of determining percent stutter in the presence of the additive (i.e., sorbitol and/or betaine) relative to the percent stutter in the absence of the additive may be used, and the percent reduction determined.

In further embodiments of the invention, methods are provided in which a sample containing nucleic acid that is suspected of containing one or more microsatellites (e.g., mononucleotide microsatellites, dinucleotide microsatellites, trinucleotide microsatellites, tetranucleotide and/or pentanucleotide microsatellites having a G+C content of 50% or less) is contacted with an enzyme that polymerizes nucleotides in the presence of an amount of betaine and/or sorbitol to effective reduce observed stutter relative to the amount of stutter observed in the absence of betaine and/or sorbitol, and amplifying at least one nucleobase sequence containing at least one microsatellite of the nucleic acid contained in the sample.

In further embodiments of the invention, PCR reactions as described herein are employed to amplify fragments from at least one microsatellite region. In certain embodiments, the fragments are amplified with a detectable tag (e.g., a fluorophore-tagged primer) or with a hybridization enhancer, e.g., a minor groove binder. Where more than one microsatellite region is to be amplified, detectable tags are selected such that different products are easily distinguished. As an example, but not by way of limitation, different colored fluorophores may be used to amplify different microsatellites. Furthermore, the same color fluorophore may be used to amplify fragments containing microsatellites that generate fragments of different sizes which are readily discernable, e.g., by electrophoretic separation. The PCR products can be analyzed in on a sieving or non-sieving medium. In some embodiments of the invention, for example, the PCR products are analyzed by capillary electrophoresis as described in Wenz, H. et al. (1998) Genome Res. 8:69-80. In other embodiments of the invention, for example, the PCR products are analyzed by slab gel electrophoresis as described in Christensen, M. et al. (1999) Scand. J. Clin. Lab. Invest. 59(3):167-177. Fragments may be analyzed by chromatography (e.g., size exclusion chromatography (SEC)).

In certain embodiments, the methods of the invention decrease the number or intensity of stutter bands, or shadow bands, on standard gels. Thus, the methods of the invention provide for more simplified and more accurate interpretation of banding patterns on gels, with increased resolution of bands. Accordingly, for some embodiments of the invention, the PCR reactions may be analyzed on agarose or polyacrylamide gels using standard techniques.

In accordance with the embodiments of the invention, the use of sorbitol or betaine in the analysis of microsatellites by PCR provides a marked enhancement in the diagnostic capabilities of microsatellites in mono-, di-, tri-, tetra- and pentanucleotide-repeat microsatellites.

In some embodiments, the invention provides kits. In some embodiments, the kits of the invention contain a concentrated stock solution of sorbitol and/or betaine for addition to the PCR reactions such that sorbitol and/or betaine is diluted to be present in an amount of 1.5 to 3.5 M in the PCR. Typically, the sorbitol and/or betaine is diluted to be present in an amount of 2 to 3 M. In addition to the sorbitol and/or betaine solution, the kits may contain at least one of the following reagents: dNTPs (either separately or as a mixture), DNA polymerase, buffer, and primers to amplify microsatellites. The kits may also contain conventional kit components, such as instructions for use.

Microsatellite markers that exhibit a high degree of polymorphism are exploited in many important applications. Recently, the A-repeat BAT loci have been used in studying microsatellite instability (MSI) which correlates with certain cancers (Chen et al. (1995) Cancer Res. 55:174-180; Ionov et al. (1993) Nature 363:558-561 De La Chapelle (1999) Eur. J. Hum. Genet. 7:407-408). However, due to the small allelic size differences in these markers and stutter during PCR, it has been difficult to determine whether the DNA samples are homozygous or heterozygous. Additionally, polymorphic variations at these loci in individuals of different ethnic origins supports the need to define the different allelic profiles and frequencies (Pyatt et al. (1999) Am. J. Pathol. 155:349-353).

Detection of genetic disorders associated with aberrant microsatellites is an application of the methods of the invention. Samples may be taken from tissues or individuals suspected of harboring aberrant microsatellites in their DNA and the DNA may be amplified by PCR in the presence of an sufficient amount of betaine and/or sorbitol under conditions sufficient to reduce stutter relative to observed stutter in the absence of betaine and/or sorbitol for microsatellites with a G+C content of 50% or less. The resulting PCR products may be compared to PCR products from normal tissue, or tissue from normal individuals, and variation assessed. Aberrant microsatellites may indicate a propensity to develop a genetically-based disorder, or may indicate the presence of a genetically-based disease. Such disorders include, but are not limited to cancer, pre-cancerous conditions (e.g., colorectal and breast cancer), and Friedreich's ataxia. As an example, but not by way of limitation, colorectal cancer may be diagnosed using the method of the invention by amplification of BAT-25 and BAT-26 poly-A repeat microsatellites in the 50C10, 52H10 and/or apoD genetic locus in the presence of a sufficient amount of sorbitol and/or betaine. As another non-limiting example, Friedreich's ataxia may be detected by amplifying the GAA/TTC microsatellite region of the frataxin gene.

In a specific embodiment, PCR amplification reactions may be set up to amplify a mononucleotide repeat using the BAT-25 or BAT-26 markers. BAT-26 and BAT-25 once were thought to exist as single alleles in the genomes of most people. BAT-26 once was thought to contain a region of 26 repeated A bases in 95% of the population (de la Chapelle (1999) Eur. J. Hum. Genet. 7:407-408), and polymorphisms in this region has been associated with colorectal tumors. Application of the present invention shows that these and other BAT loci are more polymorphic than originally thought and allows accurate determination of their true allelic size range. The PCR reactions may be set up such that one or more loci may be examined. Useful loci are 50C10, 52H10 and apoD. Sense and antisense primers for each of these loci are: 50C10 sense, 5′-cca aag gtt atg ccg agg t-3′ (SEQ ID NO: 1); 50C10 antisense, 5′-cgt tca tgc gtc tgg get t-3′ (SEQ ID NO: 2); 52H10 sense, 5′-ccc taa ctg tct cta taa aag a-3′ (SEQ ID NO: 3); 52H10 antisense, 5′-ccc aat cta tct aac aca ttg t-3′ (SEQ ID NO: 4); apoD sense, 5′-cat gtt gca aca cgt cct get-3′ (SEQ ID NO: 5); apoD antisense, 5′-ggc taa gtg aag cat gag gt-3′ (SEQ ID NO: 6); BAT-10, 5′-FAM gat aat ata gca tta taa cac tg-3′ (SEQ ID NO: 7) and 5′-gaa cac aaa gga agt gtc tg-3′ (SEQ ID NO: 8) (Parsons et al. (1995) Cancer Res. 55:5548-5550); BAT-16, 5′-FAM tcc act gtg tct tta tta gg-3′ (SEQ ID NO: 9) and 5′-aaa ccg tac tct tca cac ac-3′ (SEQ ID NO: 10) (Zhou et al. (1997) Oncogene 15:1713-1718); BAT-25, 5′FAM tcgcct cca aga atg taa gt-3′ (SEQ ID NO: 11) and 5′tct gca ttt taa cta tgg ctc-3′ (SEQ ID NO: 12) (Parsons et al. (1995) Cancer Res. 55:5548-5550); BAT-26, 5′FAM tga cta ctt ttg act tct tca gcc-3′ (SEQ ID NO: 13) and 5′-aac cat tca aca ttt tta acc-3′ (SEQ ID NO: 14) (Parsons et al. (1995) Cancer Res. 55:5548-5550); BAT-34C4 F, 5′-FAM accctg gag gat ttc atc tc-3′ (SEQ ID NO: 15) and 5′-aac aaa gcg aga ccc agt ct-3′ (SEQ ID NO: 16) (Zhou et al. (1997) Oncogene 15:1713-1718). The sense primers of each of these loci may be tagged with a different fluorophore to readily distinguish the PCR products. The PCR reactions may be set up as follows: 50 mM KCl, 10 mM Tris-HCl (pH=9.0), 0.1% Triton X-100, 4.5 mM MgCl₂, 1.0 mM each of dATP, dGTP, dCTP, and dTTP, 2.5 U Taq DNA polymerase and 2 M sorbitol. The reaction may then proceed as follows: 94° C. for 1 minutes, and 28 cycles of 94° C. for 15 seconds, 40° C. for 3 minutes and 65° C. for 5 minutes.

PCR reactions are then analyzed by denaturing samples and separating using a capillary electrophoresis protocol in an ABI PRISM® 310 genetic analyzer, or by separating on a 4.5%, 29:1 acrylamide:bis acrylamide, 8 M urea gel prepared for an ABI 377 Automated Fluorescence DNA Sequencer. Sequence data may be analyzed with GeneScan Software from Applied Biosystems. Alteration in the size of the amplified fragment as compared to normal, control tissues or samples could be indicative of cancer or a predisposition to developing cancer.

The methods of the invention are also useful in such applications as genetic mapping (linkage analysis). Linkage analysis may be accomplished, for example, by using a panel of primers to amplify a set of loci containing microsatellites that have a G+C content of 50% or less in the presence of a sufficient amount of betaine and/or sorbitol to reduce the observation of stutter from that observed in the absence of betaine and/or sorbitol. Genetic loci such as D3S1358; VWA; D16S539; D8S1179; D21S11; D18S51; D19S433; TH01; FGA; D7S820; D13S317; D5S818; CSF1PO; TPOX; hypoxanthine phosphoribosyltransferase (primers: 5′-atg cca cag ata ata cac atc ccc-3′ (SEQ ID NO: 17) and 5′-ctc tcc aga ata gtt aga tgt agg-3′ (SEQ ID NO: 18)); intestinal fatty acid-binding protein (primers: 5′-gta gta tca gtt tca tag ggt cac c-3′ (SEQ ID NO: 19) and 5′-cag ttc gtt tcc att gtc tgt ccg-3′ (SEQ ID NO: 20)); recognition/surface antigen (primers: 5′-ttg gag tcg caa gct gaa cta gcg-3′ (SEQ ID NO: 21) and 5′-cca gga agt tga ggc tgc agt gaa-3′ (SEQ ID NO: 22)); c-fms proto-oncogene for CFS-1 receptor (primers: aac ctg agt ctg cca agg act agc-3′ (SEQ ID NO: 23) and 5′-ttc cac aca cca ctg gcc atc ttc-3′ (SEQ ID NO: 24)); tyrosine hydroxylase (primers: gtg ggc tga aaa gct ccc gat tat-3′ (SEQ ID NO: 25) and 5′-att caa agg gta tct ggg ctc tgg-3′ (SEQ ID NO: 26)); pancreatic phospholipase A-2 (primers: 5′-ggt tgt aag ctc cat gag gtt aga-3′ (SEQ ID NO: 27) and 5′-ttg agc act tac tat gtg cca ggc t-3′ (SEQ ID NO: 28)); coagulation factor XIII (primers: 5′-gag gtt gca ctc gag cct ttg caa-3′ (SEQ ID NO: 29) and 5′-tcc ctg aat cat ccc aga gcc aca-3′ (SEQ ID NO: 30)); aromatase cytochrome P-450 (primers: 5′-ggt aag cag gta ctt agt tag cta a-3′ (SEQ ID NO: 31) and 5′-gtt aca gtg agc caa ggt cgt gag-3′ (SEQ ID NO: 32)); lipoprotein lipase (primers: 5′-ctg acc aag gat agt ggg ata tag-3′ (SEQ ID NO: 33) and 5′-ggt aac tga gcg aga ctg tgt ct-3′ (SEQ ID NO: 34)); c-fes/fps proto-oncogene (primers: 5′-gct tgt taa ttc atg tag gga agg c-3′ (SEQ ID NO: 35) and 5′-gta gtc cca gct act tgg cta ctc-3′ (SEQ ID NO: 36)); and unknown fragment (primers 5′-aga ggt tac agt gag ccg aga ttg-3′ (SEQ ID NO: 37) and 5′-gaa gtc cta aca gaa tgg aag gtc c-3′ (SEQ ID NO: 38)) may be amplified for a given sample of DNA from an individual. The amplification may be done in the same reaction if the primers are differentially labeled (such as by using fluorophore tags) that will allow ready identification of PCR products following amplification. After genotyping based on the electrophoretic pattern obtained, genetic mapping is completed by statistical correlation of microsatellite allele frequencies with a phenotypic trait among genetically related family members.

In the field of human identity, tetranucleotide microsatellites are used in forensic casework, establishment of convicted felon databases, disaster and military victim identification (Fre'geau et al. (1993) Biotechniques 15:100-119). Furthermore, they have proved useful in forensics to identify human remains (Hagelberg et al. (1991) Nature 352:427-429; Hammond et al. (1994) Am. J. Hum. Genet. 55:175-189). In the analysis of museum specimens (Ellegren et al. (1991) Nature 354:113) and in parentage testing. Tetranucleotide microsatellites are specifically powerful in these applications, since multiple microsatellite tests that have matching probabilities of one in several billion individuals are now available. Examples of microsatellite containing alleles which can be used for paternity, forensic and other personal identification are D3S1358; VWA; D16S539; D8S1179; D21S11; D18S51; D19S433; TH01; FGA; D7S820; D13S317; D5S818; CSF1PO; TPOX; hypoxanthine phosphoribosyltransferase (primers: 5′-atg cca cag ata ata cac atc ccc-3′ (SEQ ID NO:17) and 5′-ctc tcc aga ata gtt aga tgt agg-3′ (SEQ ID NO:18)); intestinal fatty acid-binding protein (primers: 5′-gta gta tca gtt tca tag ggt cac c-3′ (SEQ ID NO:19) and 5′-cag ttc gtt tcc att gtc tgt ccg-3′ (SEQ ID NO:20)); recognition/surface antigen (primers: 5′-ttg gag tcg caa gct gaa cta gcg-3′ (SEQ ID NO:21) and 5′-cca gga agt tga ggc tgc agt gaa-3′ (SEQ ID NO:22)); c-fms proto-oncogene for CFS-1 receptor (primers: aac ctg agt ctg cca agg act agc-3′ (SEQ ID NO:23) and 5′-ttc cac aca cca ctg gcc atc ttc-3′ (SEQ ID NO:24)); tyrosine hydroxylase (primers: gtg ggc tga aaa gct ccc gat tat-3′ (SEQ ID NO:25) and 5′-att caa agg gta tct ggg ctc tgg-3′ (SEQ ID NO:26)); pancreatic phospholipase A-2 (primers: 5′-ggt tgt aag ctc cat gag gtt aga-3′ (SEQ ID NO:27) and 5′-ttg agc act tac tat gtg cca ggc t-3′ (SEQ ID NO:28)); coagulation factor XIII (primers: 5′-gag gtt gca ctc gag cct ttg caa-3′ (SEQ ID NO:29) and 5′-tcc ctg aat cat ccc aga gcc aca-3′ (SEQ ID NO:30)); aromatase cytochrome P-450 (primers: 5′-ggt aag cag gta ctt agt tag cta a-3′ (SEQ ID NO:31) and 5′-gtt aca gtg agc caa ggt cgt gag-3′ (SEQ ID NO:32)); lipoprotein lipase (primers: 5′-ctg acc aag gat agt ggg ata tag-3′ (SEQ ID NO:33) and 5′-ggt aac tga gcg aga ctg tgt ct-3′ (SEQ ID NO:34)); c-fes/fps proto-oncogene (primers: 5′-gct tgt taa ttc atg tag gga agg c-3′ (SEQ ID NO:35) and 5′-gta gtc cca gct act tgg cta ctc-3′ (SEQ ID NO:36)); and unknown fragment (primers 5′-aga ggt tac agt gag ccg aga ttg-3′ (SEQ ID NO:37) and 5′-gaa gtc cta aca gaa tgg aag gtc c-3′ (SEQ ID NO:38)). Genotyping methods used for human identification may also be applied to plant and animal breeding, using appropriate genetic loci.

Table 1 provides additional genetic loci with tetranucleotide microsatellites having a G+C content of 50% or less which are useful in personal identification, particularly forensic analysis.

TABLE 1 Tetranucleotide Microsatellites Locus Designation Common Sequence Motif D3S1358 TCTA(TCTG)₁₋₃(TCTA)_(n) (SEQ ID NOS: 42-44) vWA TCTA(TCTG)₃₋₄(TCTA)_(n) (SEQ ID NOS: 42 and 44) D16S539 (AGAT)_(n) APOA/1 (AAAG)_(n) D8S1179 (TCTR)_(n) D21S11 (TCTA)_(m)(TCTG)_(n)(TCTA)₃TA(TCTA)₃TCA(TCTA)₂TCCATA(TCTA)_(o) (SEQ ID NO: 41)  D18S51 (AGAA)_(n) ACTBP2 (AAAG)_(n) TH01 (AATG)_(n) FGA (TTTC)₃TTTTTCT(CTTT)_(n)CTCC(TTCC)₂ (SEQ ID NO: 48) D7S820 (GATA)_(n) D13S317 (GATA)_(n) D5S818 (AGAT)_(n) CSF1PO (AGAT)_(n) TPOX (AATG)_(n) CD4 (TTTTC)_(n) CYAR04 (AAAT)_(n) F13A01 (GAAA)_(n) F13B (TTTA)_(n) FABP (ATT)_(n) FES/FPS (ATTT)_(n) HPRTB (TCTA)_(n) LPL (TTTA)_(n) Penta D (AAAGA)_(n) Penta E (AAAGA)_(n) PLA2A1 (AAT)_(n) D1S1656 (TAGA)(TAGG)_(n) D2S1242 (GAAA)(GAAG)_(n) D3S1359 (TCTA)_(n) D3S1744 (GATA)_(n) D6S477 (TCTA)_(n) D8S347 (AGAT)_(n) D8S639 (AGAT)(AGGT)_(n) D9S302 (ATCT)_(n) D10S2325 (TCTTA)_(n) D11S488 (AAAG)(GAAG)_(n) D11S554 (AAAG)_(n) D12S391 (AGAT)(AGAC)_(n) D12S1090 (GATA)_(n) D18S535 (GATA)_(n) D18S849 (GATA)_(n) D20S161 (TAGA)_(n) D22S683 (TA)(TATC)_(n) DXS6807 (GATA)_(n) D19S433 (AAGG)(AAAG)(AAGG)(TAGG)(AATG)_(n) (SEQ ID NO: 47) D10S2325 (TCTTA)_(n)

Personal identification tests may be performed on any specimen that contains nucleic acid such as bone, hair, blood, tissue and the like. DNA may be extracted from the specimen and a panel of primers to amplify a set of microsatellites used to amplify DNA in the presence of an effective amount of sorbitol and/or betaine to reduce stutter from the specimen to generate a set of amplified fragments. In forensic testing, the specimen's microsatellite amplification pattern is compared with a known sample the presumptive victim (the presumed matching source) or is compared to the pattern of amplified microsatellites derived from the presumptive victim's family members (e.g., the mother and father) wherein the same set of microsatellites is amplified in the presence of an effective amount of betaine and/or sorbitol to reduce stutter using the same primers. The pattern of microsatellite amplification may be used to confirm or rule out the identity of the victim. In paternity testing, the specimen is generally from the child and the comparison is made to the microsatellite pattern from the presumptive father, and may include matching with the microsatellite pattern from the child's mother. The pattern of microsatellite amplification may be used to confirm or rule out the identity of the father. The panel may include microsatellites with a G+C content of 50% or less such as, for example, D3S1358; vWA; D16S539; D8S1179; D21S11; D18S51; D19S433; TH01; FGA; D7S820; D13S317; D5S818; CSF1PO; TPOX; hypoxanthine phosphoribosyltransferase; intestinal fatty acid-binding protein; recognition/surface antigen; c-fms proto-oncogene for CFS-1 receptor; tyrosine hydroxylase; pancreatic phospholipase A-2; coagulation factor XIII; aromatase cytochrome P-450; lipoprotein lipase; c-fes/fps proto-oncogene; and unknown fragment. PCR conditions include 5-10 ng genomic DNA, 10 pmoles each fluorophore-tagged primer, 2.5 M sorbitol, 1 mM each dNTP, 5 mM MgCl₂, 50 mM KCl, 10 mM Tris-HCl, 5 U DNA polymerase. PCR cycle conditions may be 1 min 94° C., followed by 30 cycles of 20 seconds at 94° C., 3 minutes at 50° C., 3 minutes at 60° C., followed by one cycle of 10 minutes at 60° C. The products are examined by capillary electrophoresis coupled with GeneScan 310 analysis.

Dinucleotide microsatellites are also used in paternity testing for cattle, dogs, horses and other animals (Primmer et al. (1995) Mol. Ecol. 4:493-498). In a clinical setting, microsatellite markers can be used to monitor the degree of donor engraftment in bone marrow transplants. In hospitals, microsatellite markers are useful in specimen matching tracking. More recently, microsatellite markers have also entered other fields of science such as population biology studies on human racial and ethnic group differences (Goldstein et al. (1995) Proc. Natl. Acad. Sci. USA 92:6723-6727) and on variation in animal and plant taxa (Bruford et al. (1993) Curr. Biol. 3:939-943).

Reduction in stutter bands in accordance with the present invention is useful in all of the above applications, which are illustrative and not limiting, because, inter alia, the interpretation of the data is facilitated by the method of the invention. The methods of the invention may be used in conjunction with the methods described in the references cited herein, the disclosure of each of which is incorporated herein by reference in its entirety. In particular, the methods of the invention will simplify analyses of forensic samples, and therefore find particular utility in the forensic field.

The invention will be further described using the following actual examples, which are merely illustrative of some embodiments of the invention. The examples should not be construed in any way to limit the scope of the invention, which is defined by the appended claims.

EXAMPLES Example 1

PCR reactions were used to amplify DNA fragments containing mono-, di-, or tetranucleotide microsatellites in the presence or absence of betaine or sorbitol (FIGS. 1-6). The 50 μl reactions contained 20 mM Tris, 20 mM ammonium sulfate, 4.5 mM magnesium sulfate 1 mM each of dATP, dGTP, dCTP and dTTP, 12 pmoles oligonucleotide primers, 2 ng human genomic DNA template, 5 U AmpliTaq Gold® DNA polymerase in water. Reactions contained 3.0 M, 2.5 M, or 2.0 M betaine or 2.0 M sorbitol. The reaction conditions were as follows: 95° C. for 11 minutes, followed by 28 cycles of 94° C. for 30 seconds, 55° C. for 4 minutes and 69° C. for 6 minutes, followed by incubation at 60° C. for 45 minutes. Following PCR, samples were mixed with ROX fluorescent size marker (Applied Biosystems). Samples were denatured and separated using a protocol for capillary electrophoresis prepared for an ABI 310 Automated Genetic Analyzer according to the manufacturer's specifications. Fragment analysis data was analyzed with GeneScan Software from Applied Biosystems. For each set of peaks observed (see FIGS. 1-4), each peak corresponds to a fluorescence labeled, single stranded DNA molecule. The position of the peak on the x axis corresponds to the migration time, and therefore, the length of the DNA molecule. The height of each peak corresponds to the relative amount of PCR product for this length, as determined by relative fluorescence intensity measured in arbitrary units. A stutter peak for a mononucleotide repeat ((A)_(n) of BAT-25), is found to the left of the main peak (where the “main peak” is the allele). For a dinucleotide repeat ((CA)_(n) of D6S1581), the stutter peak is found two base pairs to the left of the allele peak. For a ((CTTT)_(n) tetranucleotide repeat of FGA, the stutter peak is four base pairs to the left of the allele peak. A reduction in the height of the stutter peak relative to the main peak is indicative of a reduction in stutter. When measuring stutter reduction, the height of the stutter peak over the height of the main peak×100 provides the percent stutter. If there are multiple stutter peaks, they are not taken collectively. Rather the height of the main stutter peak over the height of the allele peak×100 gives the percent stutter. The results for the reactions with betaine, sorbitol, and control samples (no betaine or sorbitol) are shown in FIGS. 1-7 and are summarized in Table 2:

TABLE 2 Results of betaine and sorbitol on amplification of mono-, di- and tetranucleotide microsatellites % Stutter Microsatellite Control +Betaine Control +Sorbitol Mononucleotide 175 65 (−63%) 173 62 (−65% Dinucleotide 28 14 (−50%) 49 35.5 (−30%)  Tetranucleotide 6.1 3.0 (−50%)  7.5 3.1 (−60%) * percent stutter = Peak height of stutter signal/peak height of allele signal × 100; Percentage in parentheses is percent reduction of stutter of modified conditions over control.

The reference works, patents, patent applications, and scientific literature, and other printed publications, including accession numbers to GenBank database sequences, that are referred to herein are hereby incorporated by reference in their entirety.

As those skilled in the art will appreciate, numerous changes and modifications may be made to the embodiments of the invention without departing from the spirit of the invention. It is intended that all such variations fall within the scope of the invention. 

What is claimed is:
 1. A method of personal genetic identification comprising amplifying a plurality of regions of DNA from a sample containing DNA from a subject, wherein said regions comprise at least one microsatellite selected from the group consisting of a tetranucleotide repeat and a pentanucleotide repeat, wherein said microsatellite has a G+C content of 50% or less, wherein said amplification comprises the steps of: (a) contacting said DNA with at least one enzyme having nucleic acid polymerase activity; (b) incubating said sample with said enzyme for a time and under conditions sufficient to amplify said regions; (c) separating said amplified regions, forming a microsatellite pattern; and (d) comparing said microsatellite pattern with a corresponding microsatellite pattern derived from a DNA sample from a second source; wherein said incubation is performed in the presence of an amount of an additive selected from the group consisting of sorbitol, betaine and mixtures thereof, wherein said additive is effective to reduce the amount of stutter arising from said amplification relative to the amount of such stutter observed in the absence of said additive.
 2. The method of claim 1 wherein said incubation is performed in the presence of a set of dNTPs, said set comprising a first dNTP corresponding to dATP, a second dNTP corresponding to dGTP, a third dNTP corresponding to dCTP, and a fourth dNTP corresponding to dTTP; wherein each of said dNTP is present at a concentration of at least 0.5 mM.
 3. The method of claim 1 wherein said additive is present at a concentration of 1.5 to 3.5 M.
 4. The method of claim 1 wherein said additive is present at a concentration of 2.0 to 3.0 M.
 5. The method of claim 2 wherein each of said dNTPs is present at a concentration of 0.5 mM to about 2.0 mM.
 6. The method of claim 5 wherein each of said dNTPs is present at a concentration from about 1.0 mM to about 2.0 mM.
 7. The method of claim 1 wherein the additive is betaine.
 8. The method of claim 1 wherein the additive is sorbitol.
 9. The method of claim 1 wherein the sample is a forsensic sample.
 10. The method of claim 9 wherein said second source comprises at least one selected from the group consisting of the presumed matching source, a family member of the presumed matching source, and a database of sources.
 11. The method of claim 1 wherein said regions of DNA comprise at least one genetic locus selected from the group consisting of D3S1358, VWA, D16S539, D8S1179, D21S11, D18S51, D19S433, TH01, FGA, D7S820, D13S317, D5S818, CSF1PO, TPOX, CD4, CYAR04, F13A01, F13B, FABP, FES/FPS, HPRTB, LPL, Penta D, Penta E, PLA2A1, D1S1656, D2S1242, D3S1359, D3S1744, D6S477, D8S347, D8S639, D9S302, D10S2325, D11S488, D11S554, D12S391, D12S1090, D18S535, D18S849, D20S161, D22S683, DXS6807, D19S433, and D10S2325.
 12. The method of claim 1 wherein said subject is human, bovine, dog, or horse.
 13. The method of claim 1 wherein said stutter is reduced to 60% or less than the amount of stutter obtained in the absence of said additive.
 14. The method of claim 1 wherein said stutter is reduced to 50% or less than the amount of stutter obtained in the absence of said additive.
 15. The method of claim 1 wherein the polymerase is selected from the group consisting of Taq DNA polymerase, Tth DNA polymerase, Tne Bst DNA polymerase large fragment from Bacillus stearothennophilus, DNA polymerase from Thermococcus litoralis, Tma from Thennotoga maritiama, and Pfu from Pyrococcus. 